In C++ computer programming, allocators are an important component of the C++ Standard Library. The standard library provides several data structures, such as list and set, commonly referred to as containers. A common trait among these containers is their ability to change size during the execution of the program. To achieve this, some form of dynamic memory allocation is usually required. Allocators handle all the requests for allocation and deallocation of memory for a given container. The C++ Standard Library provides general-purpose allocators that are used by default, however, custom allocators may also be supplied by the programmer.
Allocators were invented by Alexander Stepanov as part of the Standard Template Library (STL). They were originally intended as a means to make the library more flexible and independent of the underlying memory model, allowing programmers to utilize custom pointer and reference types with the library. However, in the process of adopting STL into the C++ standard, the C++ standardization committee realized that a complete abstraction of the memory model would incur unacceptable performance penalties. To remedy this, the requirements of allocators were made more restrictive. As a result, the level of customization provided by allocators is more limited than was originally envisioned by Stepanov.
Nevertheless, there are many scenarios where customized allocators are desirable. Some of the most common reasons for writing custom allocators include improving performance of allocations by using memory pools, and encapsulating access to different types of memory, like shared memory or garbage-collected memory. In particular, programs with many frequent allocations of small amounts of memory, may benefit greatly from specialized allocators, both in terms of running time and memory footprint.
Contents |
Alexander Stepanov and Meng Lee presented the Standard Template Library to the C++ standards committee in March 1994.[1] The library received preliminary approval, although a few issues were raised. In particular, Stepanov was requested to make the library containers independent of the underlying memory model,[2] which led to the creation of allocators. Consequently, all of the STL container interfaces had to be rewritten to accept allocators.
In adapting STL to be included in the C++ Standard Library, Stepanov worked closely with several members of the standards committee, including Andrew Koenig and Bjarne Stroustrup, who observed that custom allocators could potentially be used to implement persistent storage STL containers, which Stepanov at the time considered an "important and interesting insight".[2]
From the point of view of portability, all the machine-specific things which relate to the notion of address, pointer, and so on, are encapsulated within a tiny, well-understood mechanism.
—Alex Stepanov, designer of the Standard Template Library |
The original allocator proposal incorporated some language features that had not yet been accepted by the committee, namely the ability to use template arguments that are themselves templates. Since these features could not be compiled by any existing compiler, there was, according to Stepanov, "an enormous demand on Bjarne [Stroustrup]'s and Andy [Koenig]'s time trying to verify that we were using these non-implemented features correctly."[2] Where the library had previously used pointer and reference types directly, it would now only refer to the types defined by the allocator. Stepanov later described allocators as follows: "A nice feature of STL is that the only place that mentions the machine-related types (...) is encapsulated within roughly 16 lines of code."[2]
While Stepanov had originally intended allocators to completely encapsulate the memory model, the standards committee realized that this approach would lead to unacceptable efficiency degradations.[3][4] To remedy this, additional wording was added to the allocator requirements. In particular, container implementations may assume that the allocator's type definitions for pointers and related integral types are equivalent to those provided by the default allocator, and that all instances of a given allocator type always compare equal,[5][6] effectively contradicting the original design goals for allocators.[4]
Stepanov later commented that, while allocators "are not such a bad ideas in theory (...) [u]nfortunately they cannot work in practice". He observed that to make allocators really useful, a change to the core language with regards to references was necessary.[7]
Any class that fulfills the allocator requirements can be used as an allocator. In particular, a class A
capable of allocating memory for an object of type T
must provide the types A::pointer
, A::const_pointer
, A::reference
, A::const_reference
, and A::value_type
for generically declaring objects and references (or pointers) to objects of type T
. It should also provide type A::size_type
, an unsigned type which can represent the largest size for an object in the allocation model defined by A
, and similarly, a signed integral A::difference_type
that can represent the difference between any two pointers in the allocation model.[8]
Although a conforming standard library implementation is allowed to assume that the allocator's A::pointer
and A::const_pointer
are simply typedefs for T*
and T const*
, library implementors are encouraged to support more general allocators.[9]
An allocator, A
, for objects of type T
must have a member function with the signature A::pointer A::allocate(size_type n, A<void>::const_pointer hint = 0)
. This function returns a pointer to the first element of a newly allocated array large enough to contain n
objects of type T
; only the memory is allocated, and the objects are not constructed. Moreover, an optional pointer argument (that points to an object already allocated by A
) can be used as a hint to the implementation about where the new memory should be allocated in order to improve locality.[10] However, the implementation is free to ignore the argument.
The corresponding void A::deallocate(A::pointer p, A::size_type n)
member function accepts any pointer that was returned from a previous invocation of the A::allocate
member function and the number of elements to deallocate (but not destruct).
The A::max_size()
member function returns the largest number of objects of type T
that could be expected to be successfully allocated by an invocation of A::allocate
; the value returned is typically A::size_type(-1) / sizeof(T)
.[11] Also, the A::address
member function returns an A::pointer
denoting the address of an object, given an A::reference
to it.
Object construction and destruction is performed separately from allocation and deallocation.[11] The allocator is required to have two member functions, A::construct
and A::destroy
, which handles object construction and destruction, respectively. The semantics of the functions should be equivalent to the following:[8]
template <typename T> void A::construct(A::pointer p, A::const_reference t) { new ((void*) p) T(t); } template <typename T> void A::destroy(A::pointer p){ ((T*)p)->~T(); }
The above code uses the placement new
syntax, and calls the destructor directly.
Allocators should be copy-constructible. An allocator for objects of type T
can be constructed from an allocator for objects of type U
. If an allocator, A
, allocates a region of memory, R
, then R
can only be deallocated by an allocator that compares equal to A
.[8]
Allocators are required to supply a template class member template <typename U> struct A::rebind { typedef A<U> other; };
, which enables the possibility of obtaining a related allocator, parametrized in terms of a different type. For example, given an allocator type IntAllocator
for objects of type int
, a related allocator type for objects of type long
could be obtained using IntAllocator::rebind<long>::other
.[11]
One of the main reasons for writing a custom allocator is performance. Utilizing a specialized custom allocator may substantially improve the performance or memory usage, or both, of the program.[4][12] The default allocator uses operator new
to allocate memory.[13] This is often implemented as a thin layer around the C heap allocation functions,[14] which are usually optimized for infrequent allocation of large memory blocks. This approach may work well with containers who mostly allocate large chunks of memory, like vector and deque.[12] However, for containers that require frequent allocations of small objects, such as map and list, using the default allocator is generally slow.[4][14] Other common problems with a malloc-based allocator include poor locality of reference,[4] and excessive memory fragmentation.[4][14]
In short, this paragraph (...) is the Standard's "I have a dream" speech for allocators. Until that dream becomes common reality, programmers concerned about portability will limit themselves to custom allocators with no state
—Scott Meyers, Effective STL |
A popular approach to improve performance is to create a memory pool-based allocator.[12] Instead of allocating memory every time an item is inserted or removed from a container, a large block of memory (the memory pool) is allocated beforehand, possibly at the startup of the program. The custom allocator will serve individual allocation requests by simply returning a pointer to memory from the pool. Actual deallocation of memory can be deferred until the lifetime of the memory pool ends. An example of memory pool-based allocators can be found in the Boost C++ Libraries.[12]
The subject of custom allocators has been treated by many C++ experts and authors, including Scott Meyers in Effective STL and Andrei Alexandrescu in Modern C++ Design. Meyers observes that the requirement for all instances of an allocator to be equivalent in effect forces portable allocators to not have state. Although the C++ Standard does encourage library implementors to support stateful allocators,[9] Meyers calls the relevant paragraph "a lovely sentiment" that "offers you next to nothing", characterizing the restriction as "draconian".[4]
In The C++ Programming Language, Bjarne Stroustrup, on the other hand, argues that the "apparently [d]raconian restriction against per-object information in allocators is not particularly serious",[3] pointing out that most allocators do not need state, and have better performance without it. He mentions three use cases for custom allocators, namely, memory pool allocators, shared memory allocators, and garbage collected memory allocators. He presents an allocator implementation that uses an internal memory pool for fast allocation and deallocation of small chunks of memory, but notes that such an optimization may already be performed by the allocator provided by the implementation.[3]
Another viable use of custom allocators is for debugging memory-related errors.[15] This could be achieved by writing an allocator that allocates extra memory in which it places debugging information.[16] Such an allocator could be used to ensure that memory is allocated and deallocated by the same type of allocator, and also provide limited protection against overruns.[16]
When instantiating one of the standard containers, the allocator is specified through a template argument, which defaults to std::allocator<T>
:[17]
namespace std { template <class T, class Allocator = allocator<T> > class vector; // ...
Like all C++ class templates, instantiations of standard library containers with different allocator arguments are distinct types. A function expecting an std::vector<int>
argument will therefore only accept a vector
instantiated with the default allocator.